Capturing Heterogeneity in Gene Expression Studies by Surrogate Variable Analysis

نویسندگان

  • Jeffrey T Leek
  • John D Storey
چکیده

It has unambiguously been shown that genetic, environmental, demographic, and technical factors may have substantial effects on gene expression levels. In addition to the measured variable(s) of interest, there will tend to be sources of signal due to factors that are unknown, unmeasured, or too complicated to capture through simple models. We show that failing to incorporate these sources of heterogeneity into an analysis can have widespread and detrimental effects on the study. Not only can this reduce power or induce unwanted dependence across genes, but it can also introduce sources of spurious signal to many genes. This phenomenon is true even for well-designed, randomized studies. We introduce "surrogate variable analysis" (SVA) to overcome the problems caused by heterogeneity in expression studies. SVA can be applied in conjunction with standard analysis techniques to accurately capture the relationship between expression and any modeled variables of interest. We apply SVA to disease class, time course, and genetics of gene expression studies. We show that SVA increases the biological accuracy and reproducibility of analyses in genome-wide expression studies.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Summary and discussion of: “Capturing Heterogeneity in Gene Expression Studies by Surrogate Variable Analysis”

Gene expression study is well known to focus on finding association between expression levels of particular genes and some interesting variables, for example, a disease state. In such studies, besides the primary variable of interest, some other covariates are usually measured and included in the model of association tests. However, it is not possible to measure all the variables related to gen...

متن کامل

Surrogate variable analysis using partial least squares (SVA-PLS) in gene expression studies

MOTIVATION In a typical gene expression profiling study, our prime objective is to identify the genes that are differentially expressed between the samples from two different tissue types. Commonly, standard analysis of variance (ANOVA)/regression is implemented to identify the relative effects of these genes over the two types of samples from their respective arrays of expression levels. But, ...

متن کامل

Genetic polymorphism and expression analysis of cMBL gene in Iranian native and commercial chickens

The aims of this study were to compare the promoter sequence of the mannose-binding lectin (cMBL) gene in Iranian native and commercial chicken strains; as well as to compare the cMBL gene expression in crossbred and inbred chickens. In total 79 native (Western Azerbaijan native fowls, WANF) and 49 commercial (Arian Commercial Strain, ACS) birds were reared as parents under same management prac...

متن کامل

I-43: Identification of SOX3 as an XX MaleSex Reversal Gene in Mice and Jumans

Background: Mammals utilise an XX/XY system of sex determination in which the Y-linked gene SRY (Sexdetermining region Y) exerts a dominant masculinising influence on sexual development. Sex chromosome homology and comparative sequence studies suggest that SRY evolved from the related SOX3 gene on the X chromosome, although there is no direct functional evidence to support this hypothesis. The ...

متن کامل

Systematic enrichment analysis of microRNA expression profiling studies in endometriosis

Objective(s): The purpose of this study was to conduct a meta-analysis on human microRNAs (miRNAs) expression data of endometriosis tissue profiles versus those of normal controls and to identify novel putative diagnostic markers. Materials andMethods: PubMed, Embase, Web of Science, Ovid Medline were used to search for endometriosis miRNA expression profiling studies of endometriosis. The miRN...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • PLoS Genetics

دوره 3  شماره 

صفحات  -

تاریخ انتشار 2007